TFBS identification based on genetic algorithm with combined representations and adaptive post-processing

نویسندگان

  • Tak-Ming Chan
  • Kwong-Sak Leung
  • Kin-Hong Lee
چکیده

MOTIVATION Identification of transcription factor binding sites (TFBSs) plays an important role in deciphering the mechanisms of gene regulation. Recently, GAME, a Genetic Algorithm (GA)-based approach with iterative post-processing, has shown superior performance in TFBS identification. However, the basic GA in GAME is not elaborately designed, and may be trapped in local optima in real problems. The feature operators are only applied in the post-processing, but the final performance heavily depends on the GA output. Hence, both effectiveness and efficiency of the overall algorithm can be improved by introducing more advanced representations and novel operators in the GA, as well as designing the post-processing in an adaptive way. RESULTS We propose a novel framework GALF-P, consisting of Genetic Algorithm with Local Filtering (GALF) and adaptive post-processing techniques (-P), to achieve both effectiveness and efficiency for TFBS identification. GALF combines the position-led and consensus-led representations used separately in current GAs and employs a novel local filtering operator to get rid of false positives within an individual efficiently during the evolutionary process in the GA. Pre-selection is used to maintain diversity and avoid local optima. Post-processing with adaptive adding and removing is developed to handle general cases with arbitrary numbers of instances per sequence. GALF-P shows superior performance to GAME, MEME, BioProspector and BioOptimizer on synthetic datasets with difficult scenarios and real test datasets. GALF-P is also more robust and reliable when further compared with GAME, the current state-of-the-art approach. AVAILABILITY http://www.cse.cuhk.edu.hk/~tmchan/GALFP/.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of soil cation exchange capacity using support vector regression optimized by genetic algorithm and adaptive network-based fuzzy inference system

Soil cation exchange capacity (CEC) is a parameter that represents soil fertility. Being difficult to measure, pedotransfer functions (PTFs) can be routinely applied for prediction of CEC by soil physicochemical properties that can be easily measured. This study developed the support vector regression (SVR) combined with genetic algorithm (GA) together with the adaptive network-based fuzzy infe...

متن کامل

A Novel Methodology for Structural Matrix Identification using Wavelet Transform Optimized by Genetic Algorithm

With the development of the technology and increase of human dependency on structures, healthy structures play an important role in people lives and communications. Hence, structural health monitoring has been attracted strongly in recent decades. Improvement of measuring instruments made signal processing as a powerful tool in structural heath monitoring. Wavelet transform invention causes a g...

متن کامل

IIR System Identification Using Improved Harmony Search Algorithm with Chaos

Due to the fact that the error surface of adaptive infinite impulse response (IIR) systems is generally nonlinear and multimodal, the conventional derivative based techniques fail when used in adaptive identification of such systems. In this case, global optimization techniques are required in order to avoid the local minima. Harmony search (HS), a musical inspired metaheuristic, is a recently ...

متن کامل

Adaptive Neuro Fuzzy Sliding Mode Based Genetic Algorithm Control System to Control of a pH Neutralization Process

In this paper, an adaptive neuro fuzzy sliding mode based genetic algorithm (ANFSGA) controlsystem is proposed for a pH neutralization system. In pH reactors, determination and control of pH isa common problem concerning chemical-based industrial processes due to the non-linearity observedin the titration curve. An ANFSGA control system is designed to overcome the complexity of precisecontrol o...

متن کامل

Airfoil Shape Optimization with Adaptive Mutation Genetic Algorithm

An efficient method for scattering Genetic Algorithm (GA) individuals in the design space is proposed to accelerate airfoil shape optimization. The method used here is based on the variation of the mutation rate for each gene of the chromosomes by taking feedback from the current population. An adaptive method for airfoil shape parameterization is also applied and its impact on the optimum desi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 24 3  شماره 

صفحات  -

تاریخ انتشار 2008